CALDERON COMMUTATORS AND THE CAUCHY INTEGRAL ON LIPSCHITZ 
CURVES REVISITED I. FIRST COMMUTATOR AND GENERALIZATIONS 



CAMIL MUSCALU 



Abstract. This article is the first in a series of three papers, whose scope is to give new proofs 
to the well known theorems of Calderon, Coifman, Mcintosh and Meyer [1], [8], [9]. Here we 
treat the case of the first commutator of Calderon and some of its generalizations. 



1. Introduction 



This is the first paper in a sequel of three, whose aim is to give new proofs to the well 
known theorems of Calderon, Coifman, Mcintosh and Meyer [1], [8], [9], which established V 
estimates for the so called Calderon commutators and the Cauchy integral on Lipschitz curves. 

We refer the reader to the book of Coifman and Meyer [8] for a description of the history of 
these fundamental analytical objects, the role they play in analysis and the various methods that 
have been further developed to understand these operators, since the appearance of the original 
articles. 

Other expository papers where some of these results are described and connected with other 
parts of mathematics, are the proceedings of the plenary talks at the ICM 1974 in Vancouver 
and the ICM 1978 in Helsinki, given by Fefferman [12] and Calderon [2]. 

Our approach will also turn out to be flexible and generic enough, to allow us to generalize 
these classical results in various new ways. 

The first paper describes the case of the first commutator and its generalizations, the second 
one the case of the Cauchy integral on Lipschitz curves and its generalizations and finally, 
the third will be devoted to the extension of all these results to the multi-parameter setting of 
polydiscs of arbirary dimension, solving completely along the way an open question of Coifman 
from the early eighties. 

We naturally start with the first commutator. 

Given a Lipschitz function A on the real line (so A' := a e L°°(R)) one formally defines the 
linear operator C\(f) by the formula 



whenever the limit exists. This is the so called first commutator of Calderon. Note that the 
simplest particular case is obtained when A(x) = x and C\(f) becomes the classical Hilbert 
transform. 

Let us observe that when a and / are Schwartz functions, then (2) makes perfect sense. 




(1) 



where the meaning of the principal value integral is 




(2) 
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Indeed, for a fixed e > 0, one can rewrite the corresponding expression in (2) as 



f A(x + t)-A(x) 

- 2 f(x + t)dt 

Je<\t\<l/e 1 



-J 

Je<\t\<l/e 



A(x + t)- A(x) 



fix + 1)— = 



-I J' 



i(x + at)da 



dt 

f(x + t)j. 



Then, write a and / as 



and 



a(x + 



ai)= f a{^)e 2niix+ai) ^d^ 
JR 



f(x + t)= [ me 2niix ^dt. 

JR 

Using these formulas in (3), the expression becomes 



- f 2 ro e (£ m&i^e^^dm 
JR 



where 



Jo Je<|f|<l/e * 

which is known to converge uniformly to 



- I sgn(£ + a^)da. 
Jo 



(3) 



(4) 



In particular, the dominated convergence theorem implies that the limit as e — > exists in (4) 
and it is equal to 



(5) 



Because of this formula, one can think of C\ as being a bilinear operator in / and a and we will 
denote it from now on with C\(J, a). The following theorem of Calderon is classical [1]. 

Theorem 1.1. For every A' = a e L°°, the operator C\ extends naturally as a bounded linear 
operator from L P (R) into LP (R) for every 1 < p < oo, satisfying 



l|Ci(/,a)llp^ll«llco-||/ll P . 
At this point, one should observe that the symbol of (5) given by 



f sgn(£ + a^)da 
Jo 



(6) 



(7) 



CALDERON COMMUTATORS AND CAUCHY INTEGRAL I 



3 



is not a Marcinkiewicz-Hormander-Mihlin symbol [25] and as a consequence, the Coifman- 
Meyer theorem on paraproducts [8] cannot be applied. More precisely, one can see that away 
from the lines £ = and £ + %\ = 0, the symbol (7) is many times differentiable and behaves 
like a classical symbol, but along them it is only continuous. The observation on which our 
approach is based, is that in spite of this lack of differentiability, when one smoothly restricts 
(7) to an arbitrary Whitney square with respect to the origin l , the Fourier coefficients of the 
corresponding function decay at least quadratically. This fact (which will be proved carefully 
later on) will reduce the problem to the one of proving estimates for the associated bilinear 
operators, which do not grow too fast with respect to the indices of the Fourier coefficients. 
We will see that this upper bounds can grow at most logarithmically, which will be more than 
enough to make the final power series convergent. This is, in just a few words, our strategy of 
the proof. 



Before going any further, let us also remark that if one permutes the two integrations in (5), 
one can rewrite that expression as 



where BHT a is the so called bilinear Hilbert transform of parameter a. An alternative approach 
to the first commutator (suggested by Calderon), was to prove U x U° — > LP estimates for 
these operators, with implicit constants that are integrable or even uniform in a. Estimates for 
the bilinear Hilbert transform have been first proved by Lacey and Thiele in [15] and [16], and 
uniform estimates have been later on obtained by Thiele [26], Grafakos and Li [13] and Li [17]. 
It is also interesting to remark that it is not yet known whether such an approach would work 
for the second Calderon commutator 2 which this time can be written as 

1 These are squares whose sides are parallel to the coordinate axes and whose distance to the origin is comparable 
to their sidelengths. 

2 The second commutator can similarly be seen as a trilinear operator with symbol J^ Q 2 sgn(£ + + fi%{)dad{i. 




Figure 1 . The singularities of the symbol of the first commutator 
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f THT aJ3 (f, a, a)(x)dad/3. 

J[0,1] 2 

A recent result of Palsson [24] proved many estimates for the operator J Q THT aJ3 da (J3 is fixed 
now), but so far there are no LP estimates available (uniform or not) for the corresponding trilin- 
ear operator THT a p which has been named by several authors, the trilinear Hilbert transform. 

Now coming back to (6), in order to describe the way in which C\(f, a) can be defined for 
any a & L°° and any / e Z/(R), we need to say a few words about adjoints of bilinear operators. 

If m(£i, £ 2 ) is a bounded symbol, denote by T m (f l ,f 2 ) the bilinear operator given by 

TmifufiXx) = [ 2 m^ummfi^i)^^^, (8) 
JR 2 

for f\,f2 Schwartz functions. Associated with it is the trilinear form A(/i,/ 2 , fs) defined by 



Mfi Ji,h)= L T m(fi . fzKx)Mx)dx, 

again for fi,f 2 , f 3 Schwartz functions. 

There are two adjoint operators T* n \ and T*^ naturally defined by the equalities 



. , r m \f 2 , f 3 )(x)Mx)dx = A(f u f 2 , f 3 ) 
'R 
and 

f T* m 2 (f l ,f 3 )(x)f 2 (x)dx = A(f u f 2 , f 3 ) 
^R 

respectively. It is very easy to observe that both of them are also bilinear multipliers whose 
symbols are ra(-£i - fr, £2) and m(£i, -^1 - f 2 ) respectively. 

Now, if a and / are Schwartz functions, the inequality (6) is equivalent to 



I C l (f,a)(x)g(x)dx 

Ur 



< ||a|U • \\f\\p ■ \\8\\p> (9) 



for any Schwartz function g, where p' is the dual index of p (so l/p + l/p' = 1). We also know 
from the above that 

f C l (f,a)(x)g(x)dx= [ Cl 2 (f,g)(x)a(x)dx. (10) 
JR JR 

We are going to prove in the rest of the paper that 



\\Cf(f,g)\U < \\f\\ P ■ \\g\\ P > (11) 

for any Schwartz functions /, g and this shows that Cf can be extended by density to the whole 
LP x LP' . But then, this means that the right hand side of (10) makes sense for any a e L°°, 
not only for bounded Schwartz functions, and this suggests to extend C\(f, a) by duality. More 
specifically, for / e LP and a e L°°, one can define C\(f,a) to be the unique LP function 
satisfying (10) for any g e LP' . 
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This discussion also proves that to demonstrate Theorem 1.1, we only need to prove (11). 
The idea now is to discretize C* { 2 and reduce (1 1) to a discrete finite model. 

Acknowledgement: We wish to thank Eli Stein who after a talk we gave in Pisa, kindly 
pointed out to us that the maximal Theorem 4. 1 that will enter the picture later on, was actually 
known and can be found in [25] Chapter II. 

The present work has been partially supported by the NSF. 

2. Reduction to a finite localized model 

We start with some standard notations and definitions. An interval / on the real line R is 
called dyadic if it is of the form / = [2 k n, 2 k (n +1)] for some k,n eZ. We will denote by T) the 
set of all such dyadic intervals. 

If / e D, we say that a smooth function O/ is a bump adapted to / if and only if one has 

wmix)\ < c aM ■ -L . - — i_ _ 

\I\° (i + djst^)" 

for every integer N and sufficiently many derivatives a, where |/| is the length of /. If O/ is a 
bump adapted to /, we say that |/| -1/p <I>/ is an LP - normalized bump adapted to /, for 1 < p < oo. 
Also, if / e D and n e Z we denote by I n the new dyadic interval [2 k (n - n), 2 k (n + 1 - n)] 
sitting n units of length |/| away from /. 

Definition 2.1. A sequence ofL 2 - normalized bumps (O/)/ adapted to dyadic intervals I is said 
to be ofcp type, if and only if for each I there exists an interval coj(= to\i\) symmetric with respect 
to the origin, so that suppO/ c ( X > 1 and with \<x>i\ ~ 

Definition 2.2. A sequence ofL 2 - normalized bumps (O/)/ adapted to dyadic intervals I is said 
to be ofi]/ type, if and only if for each I there exists an interval (o r (= co\i\) so that suppO/ c oj, 
and with \a>i\ ~ ~ dist(0, a>i). 

Fix now ni,n 2 two integers and I Q T> a finite arbitrary collection of dyadic intervals. 
Consider also three sequences of L 2 - normalized bumps (<Dj n ) /e j, (<D 2 n ) IeI , (0^) /e j adapted to 

^ni , In 2 an( l I respectively, such that at least two of them are of \p type. 
The following theorem holds 

Theorem 2.3. The bilinear operator defined by 

Ti(f,g) := 2 jjLtf. ^ 

lei 1 1 

is bounded from LP x L q — > U for any 1 < p, q < oo and < r < oo so that l/p + 1/q = 1/r, 
with a bound of type 

0(log < n! > log < n 2 >) 

depending also implicitly on p, q but independent on the cardinality of I and on the families of 
bumps considered (here < n > simply means 2 + |n|J. 

As we will see, this Theorem 2.3 lies at the heart of our estimates and in the rest of the 
section we will show how it implies the desired inequality (11). The idea will be to discretize 
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CJ 2 and show that it can be reduced to operators of type (12). Equivalently, since they both 
have the same trilinear form, we will discretize Ci instead. We start with two Littlewood-Paley 
decompositions and write 



1(0 = £^(0 

h 

and 

Kfr) = 

where as usual, ^uM) an d v ^k 2 (^i) are supported in the regions |£| ~ 2* 1 and 161 ~ 2 kl respec- 
tively. In particular, we get 

1(^,6) = 2 (13) 

By splitting (13) over the regions where fci « fc 2 , fc 2 « k\ and &i ~ fc 2 we obtain the final 
decomposition 

l(6£i) = 2^™^ l)+ (14) 

2^(0^(6)+ (15) 

k 

Yj (16) 

By inserting this into (5), C\(f,d) splits as a sum of three different expressions. It is easy 
to see that the symbol of the one corresponding to (15) is a classical symbol and for this part 
the inequality (11) follows from the Coifman-Meyer theorem on paraproducts [8]. We are thus 
left with understanding the other two terms. Notice that the first one (corresponding to (14)) 
interacts with the line £ = 0, while the third one (corresponding to (16)) interacts with the line 
£ + 6 =0 along which the original symbol 



f 

Jo 



sgn(£ + a%\)da 

is only continuous. Also, for simplicity, from now on we will replace J () sgn(£ + a^\)da with 

Jo 1r+(£ + a €i)da since the difference of the corresponding operators is just the product of a 
and / which clearly satisfies the original Holder type inequalities. 

Let us now fix a parameter k e Z and consider the corresponding expressions (also, since 
k\ ~ &2 we assume that they are equal, for simplicity). Their trilinear forms are given by 



and 



r fr i R+ (£+^i)^ 



o,(0^(6)^(6)/(0?(^i)^(ft)^i^ 2 
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respectively. Clearly, the functions ^(£2) and Q>k(fr) have been inserted naturally into the above 
expressions (the first are supported away from zero while the support of the second contains the 
origin). 

Now, on the support of Q> k {^)^ k {^\), the function f Q 1r + a%{)da can be written as a 
double Fourier series of type 



J^C^/^e 2 ^ 1 . (17) 
Similarly, on the support of ^(^^(^O the same function can also be written as 

J^C^/^e 2 "^ 1 . (18) 



n,n\ 



The following Lemma will be crucial and gives upper bounds for these Fourier coefficients. 
Lemma 2.4. One has 



and also 



~ <n> 2 <n x > # 



|CL,I<— ^ — = + ' 



< n > 2 < n - n x > # < n > # < n x > # 



for a fixed large integer #, uniformly in k. 

We will prove this Lemma 2.4 at the end of this section. Roughly speaking, it shows that one 
has at least quadratic decay for all the Fourier coefficients. 
Now, (17) produces expressions of type 



which can be rewritten as 



f [® k (t)e 2 ^] fen) 

Jf+f,+f 2 =o 1 



e 2* 



Vkimimmi&dmdfr 



(f * ®l' n )(x)(g * V 2 k ni )(x)(h * *¥l)(x)dx 



/R 

and this can be further discretized by standard arguments as an average of expressions of type 

\I\=2~ k 

where the functions <D^ and <D^ are of if/ type. 
Similarly, (18) produces expressions of type 
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^(ft)^* 



and as we have seen, these can be further rewritten and discretized again in the form (19), where 
this time O) and <Dj are of ifj type. The connection with (12) should be clear by now. If we 
add all the expressions of type (19) for all the scales k e Z, one obtains a discrete trilinear form 
corresponding to the part of C\ related to (14) (and of course, as we discussed, there is a similar 
one related to (16)). In particular, since we are interested in estimating C\ 2 , its bilinear model is 
of the form 



lei 



which should be rewritten as 



lei 



to be able to compare it better with (12). 

Now, using the fact that C* 2 (/, g) makes perfect sense for Schwartz functions (in fact, it can be 
written as an expression similar to (5)) and by triangle inequality, Fatou's lemma and Theorem 
2.3, it follows that 

\\Cl 2 (f,g)\h < J] su P(lO ■ log < n - m > - log < m > -\\fWp ■ \\g\\p' 

n,n\ k 

which is clearly bounded by ||/|| p • \\g\\ p > as a consequence of the quadratic decay in Lemma 2.4. 
This completes the proof of (1 1). 

We now describe the proof of Lemma 2.4. We first record the following 

Lemma 2.5. One has the following identities 
(a) d) (jf l R+ (£ + a)da) = 6 tf + ft) - 5 (&. 

(*>) d l ( if ! R + ^ + = 6 ^ + £>■ 

(c) d&t (jf l R+ (^ + a)da) = <%<3f(jf l R+ (£ + a)da) = 8^ + ft). 
where 6o is the Dirac distribution with respect to the origin. 

Proof 

This is really straightforward. Let us verify (a) for example. One has 



!r + (£ + = d t 5 ^ + a ^ da ) = d f [f ^ S {a)do^ = 8 (Z + ft) - 6 (&. 



To prove now the estimates in Lemma 2.4, we rewrite (for instance) C\ n as 
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J r2 f Q l R+ (^ + ^l)^ 

JR 2 [fi Io lR+ 



(£ + Qf)^ 



(£ + a)da 



¥{%jy{%i)e- 2nin4 e- 2nin ^d%d%i, (20) 



where Y^), ¥(£1) and ¥(£1) are supported away from the origin and are adapted to scale 1. 

The idea is of course to integrate by parts as much as we can in (20) and keep track of the 
upper bounds that one gets in this way. We begin integrating by parts in £ as much as we can. 

Since both J^ 1 1r + (£ + a)da and *?(£) depend on the £ derivatives can hit either of the terms. 

If the derivative hits twice the term J () 1r + (£ + a)da then, because of the previous Lemma 2.5, 
the £ variable dissapears and becomes -£i (notice that £ cannot be zero in this case) at which 
point (20) gets simplified into an expression of type 



L 



But this term can be integrated by parts as many times as we wish and this explains the appear- 
ance of the first upper bound for |C* |. If on the contrary, the £ derivative didn't hit the term 
J^ 1 1r + (£ + a)da two times yet even after many integration by parts, this means that we already 
gained a factor of type at which moment we stop integrating in ^ and start integrating by 

parts in ^. As before, if the £j derivatives hit the term J 1|^ + a)da until one reaches 
8q(J; + £0 then ^ becomes and after that one integrates by parts a smooth function obtaining 
an upper bound of type ^# <n } n ># which is smaller than the previously discussed one. 

If finally, the %\ derivative didn't hit J Q 1r(£ + a)da until it becomes £ (£ + fx), then this 
means that it keeps hitting the smooth function of %\ in which case we obtain an upper bound 
of type ~ # ~ 3± as desired. 

J r <n>" <n\ >" 

The second term C k nn{ can be treated similarly. One should just remark that in this case the 
equality ^ = is impossible and only £> (£) remains after integrating by parts, which explains 
the slight difference between the two upper bounds. 

We are as a consequence left with proving Theorem 2.3. 



3. Proof of Theorem 2.3 
The proof is based on the method introduced in [21] and [22]. 

Let us assume without loss of generality that the families (O^ )/ and (O^)/ are of if/ type 
(since all the other possible cases can be treated in a similar way). Fix also 1 < p, q < oo and 
< r < co so that l/p + l/q = 1/r. We will prove that 7> maps U x U -> L r '°° since then (12) 
follows easily by standard interpolation arguments. 

As usual, it is enough to show that given a measurable set E c R with \E\ = 1, one can find 
E' c E with \E'\~1 and so that 
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Z ^<f^i^^l 2 )\\(hM)\ * (log < m >)0og < n 2 >) (21) 



/eJ 



where h := Xe- 

Define now the shifted maximal operator M ni and the shifted square function S n2 as follows. 



M ni f(x) := sup 



xel 



j)j JT l/Cy)l^0>Ky 



where ;f/ n (y) denotes the function 



xin l (y) = y- + 



dist(y,/ ni ) 

|/nj 



-100 



while 5 " 2 is given by 



S n2 g(x):= 



z 

v / 



\(g,® 2 ,J\ 2 
\I\ 



1/2 



As we will see later on, both of them are bounded on every LP space for 1 < p < oo, with a 
bound of type 0(log < ri! >) and 0(log < n 2 >) respectively. Using these two facts we define 
an exceptional set as follows. 

First, define the set Q by 

Qo := {x : M n> f(x) > Clog < m >} U {x : S n2 f(x) > Clog < n 2 >} . 
Let now d a positive integer and # an integer so that 2 d < |#| < 2 d+l . Define the set £l d # by 



a d # :=[x: M ai ~ # f(x) > Clog < m - # > 2 5d ] 



and then the set Q by 



Q ° := U U Q #- 

d>0 2 d <\#\<2 d+[ 

Define also the set Q, in a similar way to Q but by using the function g and the corresponding 
index n 2 instead. Then, define Q to be 



fio := ^o U Qo U £} (22) 

and finally, the exceptional set 

Q:=|x:M(l no )(x)> T ^|. 

Observe that |Q| << 1 is C is chosen large enough and this allows us to define the set E' by 
E' := E \Cl and to observe that \E'\ ~ 1 as desired. To be able to estimate (21) properly, we 
split is into two parts as follows: 
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(23) 



/nn c #0 /nn c =0 



Estimates for I. First, we observe that since / nff #0 one has Uffl < r± which means that 

|/n^|>^|/|. 

We now perform three independent stopping time type arguments for the functions /, g, h 
which will be combined carefully later on. 
Define first 



Cli = \x : M ni (/)(*) > 



C log < ni > 



and set 



then define 



and set 



I 1 = {IeI:\lnn l \>—\I\\, 



Q 2 = \x:M^(f)(x)> 



C log < ni > 

22 



I 2 = \l€l\I 1 : |/nQ 2 |> 



1 



100 



|/| , 



and so on. The constant C > here is the one in the definition of the set E' before. Clearly, 
since I is finite, we will run out of dyadic intervals after a while, thus producing the sets ({£2 ra })„ 
and ({/„})„. 



Independently, define 



and set 



then as before define 



and set 



Q\=\x:S n \g)(x)> 



C log < n 2 > 



j; = •(/€/: |/na;i > 



100 



l/l 



Q.' 2 = \x:S n \g)(x)> 



C log < n 2 > ' 
2 2 



T 2 = { I € I \ I\ : |7 n Q! 2 \ > 



1 



100 



|/| 



and so on, producing the finitely many sets ({£^})„ and ({I' n }) n . Of course, we would like to 
have such a decomposition available for h as well. To do this, we first need to construct the 
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analogue of the set Q , for it. To do this, first pick N > a big enough integer such that for 
every I e I we have \I n Q"^| > ^|/| where we defined 

Q." N = [x : S(h)(x) > C2 N ). 
Then, similarly to the previous algorithms, we define 

( C2 N 
n': N+l =\x:S(h)(x)> — 

and set 



then define 



and set 



I- N+1 =\izI:\mn': N+1 \> m \l\\, 



( C2 N ) 
W N+2 = lx : S(h)(x) > 



_ N+2 -\IeI\r N+l :\lnQ'! N J> 1 L\I\\j, 

and so on, constructing the finitely many sets ({Q."})„ and ({T^'})„. 
Using all these decompositions, we can decompose term I further as 

Z Z ^\<fM ai )\\<g^)\\^iM, (24) 

h,h>0,h>-N l€l W3 1 1 



where 



Then, observe that since I belongs to Ii u i 2 ,i 3 this means in particular that I has not been selected 
at either of the previous li - 1, l 2 - 1 and / 3 - 1 steps respectively, which means that all of 
\I n Q/j-il, \I n f2; 2 _ x | and \I n ^ 3 '_ x | are smaller than -^\I\ or equivalently, that one has 

99 

i /,-n 100 i 
99 

i fe-n 10Q i 

and 

99 

i fe-n 10Q i 



which implies that 

97 

il > 

Using this in (24) one can estimate that expression by 



l/nn^n^nn^^-i/i. (25) 
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h,h>0,h>-N l€l W} 1 1 



z r z 

* e r 

/, ,h>0,l 3 >-N Ja 'i 1 - l na ',2-i nn i 3 -i r ' n i h h , h 



-Xi(x) dx 



|l/2 |/|l/2 | 7 |l/2 

M ni (/)(x)5 n2 (g)(x)5(/i)(x)^ 



£ ^ log < m > log < n 2 > 2~ h T h T h \a Ili hJj |, 



(26) 



h,k>0,h>-N 



where 



fl W = U L 



On the other hand, we also have 

\Qj W3 \<\Q Ih \< 



x : M(xn. )(x) > 



1 



\x:M^(f)(x)> 



Clog <n x > 



2'i 



Similarly, we have 



and also 



1^1* 2* 



for every c > 1 . Here we used the fact that all the operators M n > , S 112 and S are bounded on U 
as long as 1 < s < oo and also that \E' 3 \ ~ 1. In particular, this implies that 



|Q r I < 2 h P dl 2 hq92 2 had3 

h-hh 

for any < 6 U 2 , 3 < 1, such that 6>i + 2 + 3 = 1. 
On the other hand, (26) can be split into 



(27) 



log < ri! > log < n 2 > 



< 

£ 2-^2-^^14- ^ 2-^2-fe2- / 3|Q J;] 

Ji,fe>0,/ 3 >0 h,l 2 >0,0>h>-N 



(28) 



To estimate the first expression in (28) we use the inequality (27) for 9\,9 2 , 03 so that 1 -p0\ > 0, 
1 - q0 2 > and 1 - a0 3 > 0, while to estimate the second term we use (27) for U 9 2 , 3 such 
that 1 - p9\ > 0, 1 - q9 2 > and 1 - a03 < 0. With these choices, the sum in (28) is indeed is 
0(log < ni > log < n 2 >) as desired. This ends the discussion of /. 



14 



CAMIL MUSCALU 



Estimates for II. This term is simpler to estimate, now that we defined our exceptional set so 
carefully. Notice that the intervals of interest are those inside Q. One can split them as [J d > I d 
where 



J,:=|/ e J:/cQand^<^|p<2- 



Observe that for any d > one has 

^i/i<i^i<i. 



lel d 

Also, for every I e I d one has that 2 d I n Q. c = and also there exists I dyadic and of the 
same length, which lies # steps of length |/| away from I (with 2 d < |#| < 2 d+1 ), and having the 
property that /(lfl c # 0. In particular, this means that I ni and I n2 are ni - # and n 2 - # steps of 
length |/| away from I. Using all these facts, one can estimate term 77 by 



K/^K^ M s (29) 

ZjZj 1/11/2 1/11/2 1/11/2 111 ~ ^ y > 

d>0 Iel d 11 11 11 

Z Z Z (log < n i - # >)2 M ( lo g < n 2 - # >)2 5d 2~ Md \I\ < 

d>Q 2 d <\#\<2 d+l Ield 

(log < ri! >)(log < n 2 >) 

by using the trivial fact that 



log < iij - # >< log < n, >< # > 

for j= 1,2. 

The proof is now complete. 

4. Appendix 1: Logarithmic estimates for the shifted maximal function 

The goal of this section is to prove the following theorem that have been used before. This 
result can be found in Stein [25], but we decided to give a selfcontained proof of it here (which 
we (re)discovered independently), not only for reader's convenience, but also for the fact that 
some particular notations that will be introduced, will be quoted and used later on as well. 

Theorem 4.1. ([25]) For any n e Z, the shifted maximal function M n is bounded on every LP 
space for 1 < p < oo, with a bound of type 0(\og < n >). 

Proof First, we observe that in order to prove the desired estimates, it is enough to prove them 
for the corresponding sharp maximal function M n defined by 



M"/(x):=sup-!- f \f(y)\dy (30) 

xel Knl J/ n 

where the suppremum is taken only over dyadic intervals. 
To see this, fix x and I so that xel. One can write 
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l 



<#>100' 



where /* is the dyadic interval of the same length with I n and lying # steps of length |/ n | away 
from it. In particular, using the above and assuming that the theorem holds for M n , one has 



W Mn fWp * Z T¥^ llMn+#fllp * Z 7#W log < n + # »HA> * 

Z < # 1 > iQQ (l0g(< n >< # >)} ~ l0g < n > 



,<#>' 

as desired. We are then left with proving the theorem for M n . 
Let now A > 0. We claim that one has the following inequality 

\{x : Afifix) > A}\ < (log < n >)\{x : Mf(x) > A}\ (31) 

where M is the classical Hardy-Littlewood maximal operator. Assuming (31), the theorem for 
M n follows from the Hardy-Littlewood theorem by interpolation with the trivial L°° estimate. 

To finally prove (31) denote by I n the collection of all dyadic and maximal with respect to 
inclusion intervals I n , for which 

\f(y)\dy>A. 



|/nl X 



Note that all of them are disjoint and one also has 

I n = {x : Mf(x) > A}. 



U 



Then, for every such a selected maximal dyadic interval I n , consider its dyadic subintervals 
of length |/ n |, |/ n |/2, |/ n |/2 2 ... , etc. Observe that there exsist only [log < n >] disjoint 
dyadic intervals 7„,/^...,/n° s<n>1 of the same length with |/ n |, so that the translate with -n 
corresponding units of any such smaller dyadic subinterval of I n , becomes a subinterval of one 
of these I n , I n , 7n° g<n>] . The claim is now that 

{x : M^f{x) >i)c[J (/„ U /i U ... U 7[! og<n>] ). 

To see this, pick x so that M n f(x ) > A. Then, this means that there exists a dyadic interval 
J containing x , so that 77-7 f \f(y)\dy > A. Because of the previous construction, one can for 
sure find one selected maximal interval of type I n , so that J n c / n . But then, this means in 
particular that J itself will be a subset of either 7 n or I n or ... or /|| og<n>] which implies the 
claim. 

It is now easy to see that this claim together with the disjointness of the maximal intervals 7 n , 
imply (31). The proof is then complete. 3 



3 Of course, since the trivial L°° estimate comes with an 0(1) bound, by interpolation the LP operatorial bound 
of M n will be even <9((log < n >) 1/p ). But for simplicity, we used the <9(log < n >) bound all the time. 
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5. Appendix 2: Logarithmic estimates for the shifted square function 



The goal of this last section is to prove the following theorem which played an important role 
earlier in the argument 4 . 

Theorem 5.1. For any n e Z, the shifted square function S n is bounded on every LP space for 
1 < p < oo, with a bound of type 0(\og < n >). 

Proof 

Besides the observations of the previous section, the proof is based on a classical Calderon- 
Zygmund decomposition [25]. 

First, let us observe that S n is bounded on L 2 with a bound independent of n. Indeed, one 
can see that 



HS n /ll2 



.1/2 



V / 



which is clearly comparable to the L 2 norm of the classical Littlewood-Paley square function, 
which is known to be bounded on L 2 . 
Next, we show that 



or more specifically that 



l|5 n /lli,oo<(log<n>)||/|| 1 . 



\{xeK:S n f(x)>A}\<\og<n>j\\f\\ l . 



(32) 



(33) 



Fix such a A > and perform a Calderon-Zygmund decomposition of the function / at level A. 
Pick one by one maximal dyadic intervals J so that 



\J\Jj 



\f(y)\dy > A. 



Observe that all these intervals are by construction disjoint and denote their union with Q. One 
has 



|Q| 



\f(y)\dy<\\\f\h- 



(34) 



Split now the function / as 



where 



8 ■= fXi 



f = g + b 



f(y)dy 



Xj 



4 It may very well be that this result has been observed before (as it was the case with the previous shifted 
maximal function) but since we didn't find it in the literature, we include a selfcontained proof of it in what 
follows. 
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and 



b:=f-g:=J]bj 



( >j 

j 

where 



Clearly, suppb j Q J. Observe also that one has 

\f(x)\ < A 

for every x e Q, c and as a consequence, 

llgIL < A 

since one also observes that 

where J is the unique dyadic interval containing J and twice as long. It is also important to 
observe that 

bj(y)dy = 



Xj- 



by definition and also that 



\\b 



/111 



£\bj(y)\dy < £\f(y)\dy + JV(y)k/;y) \J\ < jjf(y)\dy < A\J\ 



as we have seen. 

Using all these properties, one can write 

||xeR:5 n f(x) > A}\ < \{x e R : S n g(x) > A/2}\ + \{x e R : S n b(x) > A/2}\. (35) 
To estimate the first term in (35), we use the L 2 boundedness of S n and we write 

\{xeR:S n g(x)>A/2}\<^\\S n g\\ 2 2 < 
^\\g\\l = j 2 JT \g{x)\ 2 dx % ^A \g(x)\dx = 
\\\g\\i \f(.x)\dx + Y J ftf(x)\d^ < j\\f\\u 

as desired. 

To estimate the second term in (35), we proceed as follows. First, for any interval /, consider 
the associated J 1 , J 2 , y[ lo g <n> l as defined in the previous section and define the set Qj by 



18 



CAMIL MUSCALU 



Qj := 57 U 57 1 U 57 2 U ... U 57 [log<n>] . 



Then, one has 



IjxeR: S n b(x) > A/2}\ < 



|*e [jQj : S n b(x) > ,1/2 j 



+ 



(36) 



^xe([jQjf :S n b(x)>A/2^ 



The first expression is easy to estimate since one can write 



xe[jQj:S n b(x)>A/2\ < \jQj < (log < n >) £ |7| < (log < n >)j\\f\\i, 
j ) j j 

as we have seen before. The second expression in (36) can be majorized by 

- f S n b{x)dx<-Y f S n bj(x)dx< - Y f S n bj(x)dx 
A J(U,, njY A j J(U./ n>Y A j J&jY 

and we claim now that for any 7 one has 

f S n bj(x)dx < A\J\. 
Assuming (37), one can continue the previous inequality and further majorize it by 

jA^vizm^jWfh 

as desired. 

We are then left with proving our claim (37). First, we majorize the left hand side of it by 



(37) 



\(bj,%J\ 



y f \{bj,Oi a )\ 



h(x)\dx=y f 

h(x))dx+ Y f 



\(bj,®i B )\ 



|/| 



1/2 



\i(x))dx = 



\<bj,<s> In )\ 

| /N .,l"(n 7 )c l'l 1/2 



l/(x))J^; : = 



A + B. 



Estimating A. The main observation here is to realize that since |/| < |7| and / n (Q j) c £ 0, one 
must in particular have I n n 37 = 0. This allows one to estimate A by 



I 



1 + 



|/„I<|/| 
as required by (37) 



dist(/ n , 7) 
|/„l 



-10 



|/nl<W 



dist(/ n , 7) 



\bj(y)\dy < A\J\ 1 1 + | < i|7|, 
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Estimating B. This time, one has to take into account that fact that 
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As before, one can estimate B by 



bj(y)dy = 0. 



E K**®^' 

|/nl>W 



(38) 



where this time O^ := |/ n | 1/2 <D/ n is an L°° normalized bump. In order to emphasize that the 
dependence of n is irrelevant now, we rewrite the above expression as 



E K^^>l 



\K\>\J\ 



where the sum is over dyadic intervals K. 
Fix K such that \K\ > |7| and observe that 



\(bj,®K)\ 



bj(z)®-(z)dz 



where c j denotes the midpoint of the interval J. 
Then, observe that for z e J, one has 



1 



|0~(z)-0~(c 7 )|<|/|— (1 + 
and so the previous term becomes smaller than 



dist(/C J) 
\K\ 



-10 



< |1 + 



dist(/C J) 
\K\ 



-10 



1 



\bj(y)\dy<\J\—\l + 



dist(/C J) 
\K\ 



-10 



A\J\- 



Finally, the corresponding (37) follows from the straightforward observation that 

,-io 



2-i \K\ 



1 + 



\K\>\J\ 

2 nnrl >i;cinV T 1 



dist(iS:, J) 
\K\ 



< 1. 



By interpolating between L and weak-L we obtain the theorem for any 1 < p < 2. To 
prove the rest of the estimates we proceed as usual, by duality. Fix then 2 < p < oo. By using 
Khinchin inequality, one can write 



l|5 n /ll 



K/,Q/ n )P 
1/1 



P/2 



dx < 



f f YnitXf^jjhjix) dxdt= f YnitXf^jjht 

JRJo j Jo j 



dt, 



(39) 



where (r I ) I are the Rademacher functions and (/?/)/ the L 2 -normalized Haar functions. 
Fix now t e [0, 1] and consider the linear operator 
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f^YjnitXf^jhj. 

I 

Using the fact that S n and the Littlewood-Paley square function associated to (hj)j are bounded 
below L 2 , an argument identical to the one used to prove Theorem 2.3 shows that the above 
operator is also bounded below L 2 and by duality, above L 2 as well, with bounds independent 
of t which grow logarithmically in < n >. Using this fact in (39), completes the proof of the 
theorem. 



6. Generalizations 
Let us first observe that the first commutator C\f can also be written as 



ix)=p - v -L[r 



d/(x) = p.v. |_ I -A(x) | f(x + t)j (40) 



where A t is the finite difference operator at scale t given by 

A t g(x) := g(x + t) - g(x). 

There is a very simple way to motivate the introduction of this operator. Start with the Leibnitz 
rule identity 

(Af)'=A'f + Af 

and solve for A'f to obtain 

A'f = (Af)' - Af = D(Af) - ADf = [D, A]f 

where D is the operator of taking one derivative and A is viewed now as the operator of multi- 
plication with the function A(x). In particular, assuming that A' e L°°, the commutator [D, A] 
maps LP into itself boundedly, for every 1 < p < oo. Does this property hold for the operator 
\\D\,A\ as well ? one might ask. A straightforward calculation shows that [|D|,A] is precisely 
the first commutator of Calderon. 

Given these, it is of course natural to ask what can be said about the double commutator 
[|D|,[|D|,A]] ? 

A direct calculation shows that the expression [\D\, [\D\, A]](f)(x) is equal to 

f (A t A s \ dtds 
p.v. -£o^A(x) \f( x + t + s )-- (41) 
JR 2 \ts) t s 

a formula that can be naturally seen as a bilinear operator, this time depending on / and A". Its 
symbol can be again calculated easily and it is given by 



a: 



v2 

sgn(£ + a^)da\ 

which is precisely the square of the symbol of the first commutator of Calderon. 
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Theorem 6.1. Let a ± and b + and consider the expression 



dtds 

x + t + s) . 

t s 



Viewed as a bilinear operator in f and A", it extends naturally as a bounded operator from 
U X U into U for every 1 < p,q < oo with l/p + l/q = 1/r and 1/2 < r < oo. 

To prove this theorem, one applies the same method described earlier for the first commutator. 
One just has to observe that the symbol of this operator is given by 



^ sgn(£ + aa^Odaj sgn(£ + ab^\)da 



and after that to realize that each factor satisfies the same desired quadratic estimates. So this 
time one needs to decompose each factor as a double Fourier series as we did before. The fact 
that one can go all the way down to 1/2 with the estimates, is a simple consequence of the 
statement that series of type 

^ |C(m,m)riog < m > log < n 2 > 

are always convergent as long as the constants C{n l ,n 2 ) decay at least quadratically in n x and 
n 2 and r > 1/2. The details are straightforward and are left to the reader. And clearly, one can 
generalize the above theorem even further, in the most obvious way. We will come back to this 
in the second paper of the sequel. 

Another generalization we have in mind comes from the following identity 

A'B' = (AB)" - (BA')' - (AB')' + A'B'. (42) 
As a consequence of it, the right hand side of (42) satisfies Holder estimates of type 

\\(AB)" - {BA')' - {AB')' + A'B\ < \\A'\\ p \\B'\\ q 

for indices p, q, r as before. Does this inequality continue to hold if one replaces every derivative 
D by its modulus \D\ ? As before, a direct calculation shows that the new expression 

|D| 2 (A5) - \D\{B\D\A) - \D\{A\D\B) + {\D\A){\D\B) 

can be rewritten as 



„.v.jy^U + .))(^K* + 0)~. (43) 



S t S 



The right way to look at this formula is to view it as a bilinear operator in A' and B' . Its symbol 
can be calculated quite easily and it is given by 

( sgn(£ + a&) do)j ( sgn(£ 2 + B^)dB^ (44) 
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which is a symmetric function in the variables %\ and £ 2 - Because of this symmetry we like to 
call expressions such as the ones in (43) circular commutators. We will return to them in the 
second paper of the sequel. 

Theorem 6.2. Let a ± and b ± and consider the expression 




Viewed as a bilinear operator in A' and B', it extends naturally as a bounded operator from 
LP x 13 into U for every 1 < p,q < oo with l/p + l/q = 1/r and 1/2 < r < oo. 

The proof uses the same method, since it is not difficult to see that the symbols of such 
bilinear operators are again products of symbols of the first commutator kind and they each 
satisfy the same quadratic estimates. 
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